2025.09.15 | 数据集升级测互动;模型大小非长程瓶颈
Description
本期的 14 篇论文如下:
[00:25 ] 📚 IntrEx: A Dataset for Modeling Engagement in Educational Conversations(IntrEx:面向教育对话中参与度建模的数据集)
[01:02 ] 📏 The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs(“收益递减的幻觉”:衡量大语言模型的长时程执行能力)
[01:54 ] 🧩 X-Part: high fidelity and structure coherent shape decomposition(X-Part:高保真且结构一致的三维形状分解)
[02:33 ] 🖼 InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis(InfGen:分辨率无关的可扩展图像合成新范式)
[03:04 ] 🔍 HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering(HANRAG:面向多跳问答的启发式精准抗噪检索增强生成方法)
[03:50 ] 🎙 VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions(VStyle:基于语音指令的语音风格自适应基准)
[04:44 ] 🌸 FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies(FLOWER:以高效视觉-语言-动作流策略普及通用机器人策略)
[05:20 ] 🎨 Inpainting-Guided Policy Optimization for Diffusion Large Language Models(面向扩散大语言模型的基于文本补全引导的策略优化方法)
[05:58 ] 🤖 Virtual Agent Economies(虚拟代理经济)
[06:28 ] 📈 QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading(QuantAgent:面向高频交易的价格驱动多智能体大语言模型框架)
[07:02 ] 🧪 MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools(MCP-AgentBench:基于MCP中介工具的通用语言智能体真实性能评测)
[07:41 ] 🎨 Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation(精准上色:连接感知色彩空间与文本嵌入以提升扩散生成质量)
[08:31 ] 🦎 LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios(LoFT:面向开放世界长尾场景的参数高效半监督微调方法)
[09:13 ] 🗞 CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China(CMHG:中国少数民族语言新闻标题生成数据集与评测基准)
<figure>
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递